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Higgs boson hadronic branching ratios at the ILC 
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We present a study of the Higgs boson decay brandling ratios to bb, cc and gluons, one of 
the cornerstones of the physics program at the International Linear Collider (ILC). A standard 
model Higgs boson of 120 GeV mass, produced in the Higgs-strahlung process at y's = 250 GeV 
was investigated using the full detector simulation and reconstruction procedures. The analysis was 
performed in the framework of the Silicon Detector (SiD) concept with full account of inclusive 
standard model backgrounds. The selected decay modes contained two heavy flavour jets in the 
final state and required excellent flavour tagging through precise reconstruction of interaction and 
decay vertices in the detector. A new signal discrimination technique using correlations of neural 
network outputs was used to determine the branching ratios and estimate their uncertainties, 4.8%, 
8.4% and 12.2% for bh, cc and gluons respectively. 

PACS numbers: 14.80.Bn,14.80.Cp 
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I. INTRODUCTION 

The measurement of the Higgs absolute branching ra- 
tios in all possible decay modes is one of the most impor- 
tant parts of the ILC program, giving a precision test of 
the standard model (SM) prediction that the Higgs bo- 
son couples to each particle in proportion to its mass [l| . 
While the Large Hadron Collider (LHC) is a likely dis- 
covery machine for the Higgs boson it will be almost im- 
possible to determine the total Higgs cross section there, 
making it difficult to make absolute measurements of par- 
tial decay vifidths. On the other hand, precise and model 
independent measurements of absolute branching ratios 
can be performed at the ILC ^. 

In this paper the Higgs decay modes to charm quarks, 
bottom quarks and gluons which result in two and four- 
jet final states are considered. Heavy quarks, h and c, 
couple directly to the Higgs while the Higgs decay to 
gluons in the standard model is mediated by heavy quark 
loops. The branching ratio to gluons is indirectly related 
to tiH Yukawa coupling [3,] and would probe the existence 
of new strongly interacting particles that couple to the 
Higgs and are too heavy to be produced directly. The 
precise measurements of the branching ratios will help to 
discriminate between different 'Beyond the SM' (BSM) 
scenarios [J, Q where the Higgs couplings differ from SM 
Higgs couplings. 

The production cross sections, branching ratios and 
decay widths of the Higgs are determined by the strength 
of the Yukawa couplings to fermions and gauge bosons, 
whose scale is set by the masses of these particles. For 
a Higgs boson mass of 120 GeV, the expected branching 
ratios to bottom quarks, charm quarks and gluons are 
67.9%, 3.1% and 7.1%, respectively. 

While the Higgs branching ratios in the e+e" collider 
environment has been studied before [6|, up to date it 
was done in a parametric way from the detector point of 
view, without exploiting full Monte Carlo (MC) simula- 
tions nor realistic reconstruction algorithms. It should 



be noted that this measurement is a complex analysis 
which heavily relies on the bottom and charm quark tag- 
ging. Possible detector resolution and acceptance effects 
together with non-ideal efficiency and purity of the re- 
construction algorithms could play a critical role in the 
ultimate sensitivity of the experiment and hence in its 
physics reach. This study for the first time addresses 
these issues in the framework of the Silicon Detector 
(SiD) concept as one of the benchmarking analyses pre- 
pared for the SiD Letter of Intent Q . A new technique 
employing two neural networks was used to overcome one 
of the main difficulties of the analysis: separation of sig- 
nal from the background of Higgs decays to other parti- 
cles. 

The paper is organized as follows: the Higgs boson 
production at the ILC and the SiD detector concept are 
introduced in Section [Hi Sections IIIII and ITVl describe the 
details of the analysis and its results, which are summa- 
rized in Section FVl 



II. HIGGS PRODUCTION AT THE ILC 



A. The ILC 

The ILC is a next generation electron-positron accel- 
erator designed to collide particles at the centre-of-mass 
energy up to 500 GeV, upgradeable to 1 TeV, with peak 
luminosity of 2xl0'^'*cm~^s~^. The dominant produc- 
tion mechanisms for the SM Higgs at e"'"e~ colliders are 
Higgs-strahlung [^, Q , WW and ZZ fusion processes [ifl] 



e+e- -^ ZH ^ fJH 
e+e- -^ vvW*W* -> vvH 
e+e- -^ e+e-Z*Z* -^ e+e-H. 



(1) 



The Higgs-strahlung and WW fusion processes are 
both capable of generating vvH states. 



B. The SiD Detector Concept 

The sensitivity to Higgs decay branching ratios mea- 
surement was studied in the framework of the Sihcon 
Detector (SiD) concept [^ using full detector simulation 
and event reconstruction. SiD is designed for precision 
measurements of a wide range of possible new phenom- 
ena at the ILC. It is based on a silicon pixel vertex de- 
tector, silicon tracking, silicon-tungsten electromagnetic 
calorimetry, and highly segmented hadronic calorimetry. 
The Particle Flow Algorithm (PFA) approach [7] is an 
important strategy driving the basic philosophy and lay- 
out of the detector. SiD also incorporates a 5 T solenoid, 
iron flux return and a muon identification system. A 
schematic view of SiD quadrant is shown in Figure [TJ 

The vertex detector comprises a central barrel with 
five silicon pixel layers and four silicon pixel disks for 
the forward and backward regions providing the impact 
parameter resolution of 5© 10/ (psin^" 0) [/xm] and good 
hermeticity for cos6' < 0.984 where 9 is the polar angle. 

SiD has adopted silicon strip technology for its tracker, 
arranged in five cylinders and four endcaps necessary for 
precision tracking and momentum measurement. Given 
an outer cylinder radius of 1.25 m, the charged track 
momentum resolution is expected to be better than 
"■(I/pt) = 5 X 10~^ (GeV/c)~^ for high momentum 
tracks. 

Calorimetry in SiD is optimized for jet energy mea- 
surements, and is based on the Particle Flow strategy, 
in which the momenta of charged particles are measured 
in the tracker; the momenta of neutrals are measured 
in the calorimeters, and then the charged and neutral 
components are combined. The SiD calorimetry begins 
with a dense, highly pixelated silicon-tungsten electro- 
magnetic calorimeter with energy resolution of 11%/ ^/E. 
The Hadronic Calorimeter (HCAL) is composed of 4.5A 
HCAL depth of stainless steel, divided into 40 layers of 
steel and detector. A good jet energy resolution, about 
4%, allows the invariant masses of VF's, Z^s, and top 
quarks to be reconstructed with resolution nearing the 
natural widths of these particles. 



III. ANALYSIS 



Generator |11| taking into account beamstrahhmg and 
initial state radiation (ISR) photons [7]. PYTHIA \\^ 
was used for the final state QED and QCD parton show- 
ering, fragmentation and decay to provide final-state ob- 
servable particles. For this study, event samples were 
created conforming to the expected ILC baseline param- 
eters of ±80% electron and ^30% positron polarization. 
About 7M events were processed through the full detec- 
tor simulation, with individual events weighted to reflect 
the statistical sampling. 

The detector response to generated events was simu- 
lated using the Geant4 toolkit [1^ [lj| , which provided 
the necessary classes to describe the geometry of the de- 
tector, the transport and interaction of particles with 
materials and fields. A thin layer of Linear Collider 
specific code, SLIC |15j, provided access to the Monte 
Carlo events, the detector geometry and the output of 
the detector hits. The detector parameters could be var- 
ied without having to rebuild the simulation executable 
binaries since the geometries were fully described at run- 
time. The output was in the standard LCIO format [lg| 
so that detector concepts using other simulation packages 
could be studied and data generated using this system 
can be analyzed in other analysis frameworks. 

The identification of jets is an important part of this 
analysis. The fragmentation products of the hadronic 
systems were forced either to two or four jets, depending 
on the final state, using the DURHAM algorithm 17[. To 
provide the most probable kinematic configuration of the 
event topology, a kinematic fitter, Marlin Kinfit 18], with 
four-momenta and mass constraints was used. The fitter 
uses the method of Lagrange multipliers to determine the 
most probable value for the jet four-momentum. 

In order to identify primary, secondary and tertiary 
vertices the topological vertex finder ZVTOP was uti- 
lized. The algorithm is part of a vertexing package devel- 
oped by the LCFI collaboration [l^. It classifies events 
on the number of found vertices and combines eight opti- 
mized variables for each type of event in a neural network, 
which is then separately trained on samples of b, c and 
light quarks. The best discriminating variables were the 
corrected vertex mass, secondary vertex probability, im- 
pact parameter significance and the number of vertices 
in the event. 



A. Analysis Framework 

Both the signal and background events were produced 
at the centre-of-mass energy y/s = 250 GeV, total inte- 
grated luminosity of 250 fb^^ and the Higgs mass of 120 
GeV. The choice of energy in this analysis maximizes the 
cross-section value for Higgs-strahlung. Only HZ pro- 
cess was considered as it accounts for about 95% of the 
total Hff cross section for this energy and Higgs mass 
choices. SM events (mainly WW, ZZ and qq pairs) and 
Higgs decays to fermions other than the signal were con- 
sidered as backgrounds. All 0, 2 and 4 fermion final states 
were generated using the WHIZARD Monte Carlo Event 



B. Event Selection 

The analysis signature is dependent on the Z boson 
decay products (charged leptons, hadrons or neutrinos). 
The channels studied in this analysis were the neutrino 
mode {Z decaying to neutrinos) and the hadronic mode 
(Z decaying to hadrons), referred below as the signal. 

The events were classified into the two channels using 
the number of leptons and visible energy in the event. 
The leptons were defined as reconstructed electrons or 
muons with momenta larger than 15 GeV. The visible 
energy was defined as the sum of energies of all recon- 
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FIG. 1. A plane view of a quadrant of SiD. Dimensions are in mm. 



structed particles [7| in the event. Figure [5] shows the dis- 
tributions of the visible energy and the number of leptons 
for the signal, Higgs background and SM background be- 
fore any selections, normalized to 250 fb~^. 

In the neutrino mode, events not having leptons and 
with visible energy in the 90 to 160 GeV interval were 
selected. In this channel all reconstructed particles were 
clustered into two jets which, for the signal, were as- 
sumed to come from the Higgs boson recoiling against 
two neutrinos from the Z boson decay. The most im- 
portant backgrounds in this channel were two-fermion 
events, ZZ pairs decaying to neutrinos and hadrons, and 
WW pairs where one W decays hadronically and the 
other W decays into a neutrino and a lepton, which es- 
capes undetected. 

A cut based selection was performed to further reduce 
the backgrounds in this channel using kinematic vari- 
ables described below. The transverse momentum of each 
jet, pt, was calculated to reject SM background events 
which are softer compared to signal events. The number 



of tracks per jet, ritracks, was used to reject purely lep- 
tonic events which were not part of the signal. The dis- 
crimination between the signal and backgrounds with a 
different number of jets was achieved by the ymm param- 
eter, which corresponds to the minimum y-parameter for 
the two jet hypothesis in the DURHAM algorithm. The 
variable — log{ymin) was used to exclude fully hadronic 
WW (and ZZ) events, which were four-jet events mis- 
identified as two-jet events. Thrust [20] values were cal- 
culated since it is expected that signal events are less 
boosted and are more spherical than the background 
events. Values of \cos{9thrust)\ were also used because 
signal events are expected to be produced more centrally 
in the detector while majority of the background pro- 
cesses have a strongly forward peaking angular distribu- 
tion. A large fraction of background events have back-to- 
back jets whereas signal events were confined to a range 
of angular values due to kinematic constraints coming 
from ZH production. The di-jet invariant mass was used 
to reject low mass hadronic systems from WW, ZZ and 
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FIG. 2. Visible energy (a) and the number of leptons per event (b) used for channel classification. Solid curves are SM 
background, dashed curves are inclusive Higgs sample and filled histograms are the signals. All histograms are normalized to 
250fb-\ 



two-photon events. The signal events rarely have hard 
photons and this helps reject both highly energetic ini- 
tial state radiation (ISR) photons and hard photons from 
di-jet events which occur for the background. No photon 
isolation was required in this particular case. Figure [3] 
shows some of the distributions for the above variables 
used in the neutrino channel before any selections. 

The hadronic mode was selected as events with no 
leptons and the visible energy above 170 GeV. In this 
channel, events were forced to have four reconstructed 
jets. For the signal, two of the jets were required to have 
their di-jet invariant mass consistent with the Higgs bo- 
son and the other two having the mass consistent with 
the Z boson mass. The main backgrounds for this chan- 
nel were WW and ZZ pairs where the all bosons decay to 
hadrons. For the pre-selection, we used kinematic selec- 
tions defined similar to the ones in the neutrino channel. 
The full list of selections in the neutrino and hadronic 
channels is given in Tables H] and [III Table IIIII instead 
shows the number of events before and after pre-selection 
cuts for cc, bb and gg modes in the neutrino and hadronic 
channels before and after pre-selection cuts. The number 
of events for the SM background is indicated in the same 
table. 



As mentioned before, the LCFI vertexing package was 
used for vertex finding and identification of the fiavour of 
hadrons in jets. The flavour identification is essential to 
differentiate b jets from c jets and jets coming from light 
(m, d, s) quark hadronization. In analyses like the present 



one where multi-6 states are to be separated from signif- 
icant backgrounds, a high ^-tagging efficiency is required 
to discriminate against light and c-quarks. Furthermore, 
the measurement of the H ^>- cc branching ratio requires 
efficient c-tagging with high rejection of &-quarks. The 
package produces three possible flavour tags, '6-tag', 'c- 
tag' and 'c-tag on 5-background only', for each jet in the 
event. Figure S] shows the distribution of these variables 
for the leading jet in the hadron channel for the cc decay 
mode after preselections. 

The remaining events were categorized using the neu- 
ral networks implemented in FANN [2l| . The neural net- 
work (NN) used optimized sets of 18 input variables for 
the neutrino channel and of 19 input variables for the 
hadronic channel. The neural net was constructed out of 
three hidden layers with 28 neurons and had one output 
neuron. For cc and bb signals, the first NN was trained to 
distinguish the SM background from the inclusive Higgs 
sample and to produce the NNgM-ffjggs output. In the 
gg case, the first NN was trained to distinguish the sig- 
nal sample from the SM background and to produce the 
NNsig_5M output. The second NN was, in aU cases, 
trained to distinguish the signal from the Higgs inclusive 
background sample and to produce the NNniggs-signai 
output. The training was done separately for cc, bb and 
for gg using independent samples. Figure [S] shows an ex- 
ample of distributions of the first and second NNs in the 
cc decay mode for the neutrino channel. For the cc/bb/gg 
scenarios the signal was defined as _ff — )• cc/bb/gg events 
only and the Higgs background included all Higgs decays 
other than the signal ones. 

The final event samples were determined using a si- 
multaneous selection on the first and second neural nets 
as specified in Table IIVI The selection was performed 
separately for each channel and decay mode by finding 
an optimal point at which the signal cross section un- 
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FIG. 3. Distributions of pre-selection kinematic variables in the neutrino channel: (a) di-jet invariant mass, (b) number 
of charged tracks in the event, (c) jet pt, (d) highest reconstructed photon energy in the event. Sohd curves are the SM 
background, dashed curves are the Higgs background and filled histograms represent the signal. All histograms are normalized 
to 250 fb~^ and are shown before any selections. 





selection 






value 


(1) 20 


< Pt of jet 






< 90 GeV 


(2)4 


< number of charged tracks per 


jet 




(3) 


-log(ymin) 






< 0.8 


(4) 


thrust 






<0.95 


(5) 


COs{ethrust) 






< 0.98 


(6) 100° 


< angle between jets 






< 170° 


(7) 100 GeV 


< di-jet invariant mass 






< 140 GeV 


(8) 


Highest reconstructed 


photon 


energy 


< 10 GeV 



TABLE I. Selections for the neutrino channel. 



certainty was mininiized. A detailed account of event 
selections and variable distributions for all decay modes 
in both channels is given in ^22r-24] . 



IV. BRANCHING RATIO CALCULATION 



The branching ratio of the Higgs boson decay were cal- 
culated using events that passed the final neural network 
selection. The calculation was done by normalising the 
signal cross section to the inclusive Higgs cross section. 





selection 




value 


(1) 4 


< number of charged tracks per 


jet 




(2) 


-logivmin) 




< 2.7 


(3) 


thrust 




<0.95 


(4) 


COs{ethrust) 




<0.96 


(5) 105° 


< angle between jet 1 and 3 




< 165° 


(6) 70° 


< angle between jet 2 and 4 




< 160° 


(7) 110 GeV 


< invariant mass of Higgs candidate after fit 


< 140 GeV 


(8) 80 GeV 


< invariant mass of Z candidate 


after fit 


< 110 GeV 


(9) 


Highest reconstructed photon 


energy 


< 10 GeV 



TABLE II. Selections for the hadronic channel. The jets are ranked in the decreasing energy order. 





SM 


H -^cc 


H ^bb 


H -^ gg 


Neutrino Channel 


Before 
After 


45936973 
109057 


637 
506 


11580 
6707 


986 
759 


Hadronic Channel 


Before 
After 


39398366 
967312 


1837 
947 


30985 
15805 


2965 
1611 



TABLE HI. Number of events before and after pre-selections 
in the neutrino and hadronic modes. 



Decay Mode 


Neutrino 


Hadronic 




NNl > 


NN2 > 


NNl > 


NN2 > 


cc 


0.33 


0.38 


0.36 


0.29 


bb 


0.81 


0.92 


0.64 


0.88 


gg 


0.71 


0.87 


0.10 


0.27 



TABLE IV. Neural net selections in the neutrino and hadronic 
channels. 



fzff = 209.0±9.8 fb, as determined in an independent 
recoil mass analysis performed for the SiD Letter of In- 
tent [7|. The branching ratio is then given by 



BR{H ^ //) = 



^H^f! 



azH X BR{Z — >• qq, w) 



(2) 



where an^ff is the signal cross section and BR{Z ^■ 
qq, vv) is the decay branching ratio of the Z boson into 
two jets or neutrinos depending on the channel. BR{Z — > 
qq, vV) values were taken from the generator tables. The 
cross section was calculated as follows: 



_ N 



(3) 



where N was the number of signal events after all 
selections, e was the total signal efficiency and L was 
the integrated luminosity. The cross section uncertainty 
took into account statistical fluctuations in the number 
of both signal and background events. Calculation of 
BR{H -^ gg) follows exactly the same procedure. 

Summary of the results obtained for both neutrino and 
hadronic channels is given in Table |Vl It shows the num- 
ber of events after all selections, signal efficiency, cross 



section and the branching ratios with their uncertain- 
ties for each channel separately and in combination. The 
cross sections and branching ratios, determined for com- 
binations of neutrino and hadronic channels, and their 
relative uncertainties were calculated assuming that the 
two channels were statistically independent. The uncer- 
tainty of the cross section was considered to be of purely 
statistical nature while the uncertainty of the branching 
ratio also took into account the total Higgs-strahlung 
cross section uncertainty, which is correlated for both 
channels and is the largest source of systematic uncer- 
tainty in the analysis. Other systematic uncertainties 
originating from BR{Z — >■ qq, vv) and luminosity uncer- 
tainties are of the order of 0.1% and were neglected. The 
efficiency was derived using larger signal samples and was 
assumed not to contribute to the uncertainty. Systematic 
uncertainties originating from the detector properties, re- 
construction algorithms and from the polarization mea- 
surement uncertainty were not addressed in this study 
though they are expected to be small. 



The branching ratios were measured as 3.3%, 68.2% and 
7.4% for cc, bb and gg respectively. It was found that 
deviations of the determined branching ratios from the 
values used in the MC generator were within the statis- 
tical uncertainties in all cases implying that the analysis 
did not bias the measurement. The relative uncertain- 
ties on the branching ratios were determined to be equal 
to 8.4%, 4.8% and 12.2% for cc, bb and gg respectively. 
For the cc and gg measurements, the relative uncertainty 
of the signal cross section dominated the uncertainty of 
the branching ratio. However, for the main decay chan- 
nel, bb, the branching ratio uncertainty was dominated 
by the uncertainty on the inclusive Higgs-strahlung cross 
section used for normalization. 

The obtained results were found largely consistent with 
results from earlier studies in [2^, [2p] and p\, which all 
used simplified approaches either to the simulation, or 
reconstruction, or both. The deterioration of sensitivity 
due to the added realism was recovered with a new sig- 
nal discrimination technique which utilizes correlations 
of two neural networks. 



Events 



Events 




0.4 0.6 0.S 1 

Neural Network Output 
(a) 




0.4 0.6 0.3 1 

Neural Network Output 
(b) 



Events 




0.6 o.a 1 

Neural Network Output 

(c) 

FIG. 4. LCFI flavour tag distributions for the leading jet in the hadronic channel for the cc decay mode: (a) fe-tagging variable, 
(b) c-tagging variable and (c) c-tagging variable for the case of 6-background only. Solid curves are SM background, dashed 
curves are Higgs background sample and filled histograms are the signal. All histograms are normalized to 250 fb~^ and are 
show after preselections. 



V. SUMMARY 



Higgs-strahlung cross section. 



The sensitivity to the decay branching ratios of a neu- 
tral 120 GeV SM Higgs boson to charm quarks, bottom 
quarks and gluons has been studied at the ILC centre-of- 
mass energy of -^/s = 250 GeV and integrated luminosity 
of 250 fb~^. The analysis is based on full simulation and 
realistic event reconstruction in the SiD detector and also 
fully accounts for the standard model background pro- 
cesses. The relative uncertainties obtained are compara- 
ble to values obtained in some of the previous studies. 
Good performance of flavour tagging and the use of neu- 
ral networks in event selection were critical in obtaining 
these results. The uncertainties on the branching ratios 
were found to be equal to 8.4%, 4.8% and 12.2%_for cc, bb 
and gg, respectively. The uncertainty in the bb branch- 
ing ratio is dominated by the uncertainty on the inclusive 
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FIG. 5. Neural network outputs in the cc decay mode for the neutrino channel: (a) First NN and (b) second NN. Solid curves 
are SM background, dashed curves are Higgs background sample and filled histograms are the signal. All histograms are 
normalized to 250 fb~^. 





Neutrino 


Hadronic 


Combined 


H^cc 


Signal events 


178 


407 






SM background events 


140 


673 






Higgs background events 


109 


213 






Signal efficiency 


27.9 


22.2 






Signal an-^cc 


6.8±0.8fl3 


6.9±0.6fb 


6.86 ± 0.48 fb 




Relative uncertainty on au^cc 


11.6% 


8.8% 


7.0% 




Higgs BR 


3.3±0.4% 


3.3±0.3% 


3.3±0.3% 




Relative uncertainty on Higgs BR 


12.5% 


10.0% 


8.4% 


H ^bb 


Signal events 


2833 


8122 






SM background events 


220 


4700 






Higgs background events 


55 


423 






Signal efficiency 


24.5 


26.2 






Signal ou^bi 


142.7 ± 2.3 flj 


142.5 ± 1.9 fb 


142.57 ± 1.61 fb 




Relative uncertainty on crjj^f,^ 


1.9% 


1.4% 


1.1% 




Higgs BR 


68.3±3.4% 


68.2±3.3% 


68.2±5.3% 




Relative uncertainty on Higgs BR 


5.0% 


4.9% 


4.8% 


H^gg 


Signal events 


32 


524 






SM background events 





3621 






Higgs background events 


4 


1431 






Signal efficiency 


3.3 


17.7 






Signal an-tgg 


15.1±1.9fb 


15.6 ± 2.6 flj 


15.41 ± 1.74 fb 




Relative uncertainty on ou^gg 


18.7% 


14.2% 


11.3% 




Higgs BR 


7.2±1.4% 


7.5±1.1% 


7.4±0.9% 




Relative uncertainty on Higgs BR 


19.3% 


15.0% 


12.2% 



TABLE V. Results for the H ^ cc, H ^- bb and H ^r gg decay modes. 
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